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Abstract 

One-sample and multi-sample tests on the concentration parameter of Fisher-von 
Mises-Langevin (FvML) distributions have been well studied in the literature. How- 
ever, only very little is known about their behavior under local alternatives, which 
is due to complications inherent to the curved nature of the parameter space. The 
aim of the present paper therefore consists in filling that gap by having recourse to 
the Le Cam methodology, which has been adapted from the linear to the spherical 
setup in Ley et al. (2013a). We obtain explicit expressions of the powers for the most 
efficient one- and multi-sample tests; these tests are those considered in Watamori and 
Jupp (2005). As a nice by-product, we are also able to write down the powers (against 
local FvML alternatives) of the celebrated Rayleigh (1919) test of uniformity. A Monte 
Carlo simulation study confirms our theoretical findings and shows the finite-sample 
behavior of the above-mentioned procedures. 

Keywords: concentration parameter, directional statistics, Fisher-von Mises-Langevin dis- 
tributions, Le Cam's third Lemma, uniform local asymptotic normality. 



1 Introduction 

The field of directional (circular in dimension k = 2, spherical in higher dimensions) statis- 
tics has become increasingly popular over the past decades, stimulated by the pioneering 
and seminal paper Fisher (1953). More recent cornerstone references are the monographs 
Fisher et al. (1987) and Mardia and Jupp (2000). This domain, which is particularly suited 
for modeling and explaining phenomena arising in earth sciences, meteorology, the study 
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of animal behavior, astronomy or neurosciences, to cite but these areas, is concerned with 
observations conceived as realizations of random vectors X taking values on the surface 
of the unit hypersphere S k ~ 1 := {v G M fc | ||v|| = 1}. 

By far the most popular and most used directional distribution is the Fisher-von Mises- 
Langevin (FvML) distribution (named, according to Watson 1983, after von Mises 1918 
for k = 2, Fisher 1953 for k = 3, and Langevin 1905 for general k), whose density is of the 
form (with respect to the usual surface area measure on spheres) 

/ K ,e(x) = c Kk exp^x'0), x G S k ~\ (1.1) 

where k > is a concentration parameter, 9 G S k ~ 1 a (spherical or directional) location 
parameter and where the normalizing constant Cfc jK is given by 

/KN*/2-l 1 

Ck > K ~ UJ T(k/2)I k/2 ^( K y 
with T(k/2) the Gamma function evaluated at k/2 and Ik/2-i( K ) the modified Bessel func- 
tion of the first kind and of order k/2—1. The FvML distribution is considered as the direc- 
tional analogue of the (linear) Gaussian distribution for purposes of mathematical statistics 
(see Schaeben 1992 for a discussion on directional analogues of the Gaussian distribution). 
This analogy is mainly due to the fact that the FvML distribution can be characterized 
by the empirical spherical mean 0Mean := Yli=i x */ll Yli=i x *ll> X i> ■ • • > X n G S fc ~\ as 
the Maximum Likelihood Estimator (MLE) of its (spherical) location parameter, similarly 
as the Gaussian distribution can be characterized by the empirical mean n _1 ^?=i x «i 
Xi, . . . , X n G as the MLE of its classical (linear) location parameter, a famous result 
due to Gauss. We refer to Duerinckx and Ley (2013) for a formal proof of this fact and 
for more details on directional MLE characterizations 1 . 

Due to its prominent role, the FvML distribution has received a lot of attention in the 
literature, and inferential procedures involving its concentration and location parameters 
have been extensively studied in the literature (see for instance Sections 10.4-10.6 in Mar- 
dia and Jupp 2000). In the present paper, the parameter of interest is the concentration 

1 It is interesting in this context to note that Gauss, in his manuscript "Theoria motus corporum 
coelestium in sectionibus conicis solem ambientium" of 1809, has defined the famous distribution named 
after him by searching for the probability law for which the sample mean is always the MLE of the 
location parameter, and that von Mises, in 1918, aiming at constructing a circular analogue of the Gaussian 
distribution, started precisely from this classical MLE characterization. 
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parameter k which regulates the probability mass in the vicinity of the modal direction 6. 
Besides the tests described in Mardia and Jupp (2000), hypothesis testing procedures deal- 
ing specifically with the concentration parameter can mainly be found in Stephens (1969), 
Larsen et al. (2002) and Watamori and Jupp (2005). Due to their efficiency properties, 
the proposed procedures are either likelihood ratio (in its basic and improved versions) 
or score tests. Even if the asymptotic theory of such tests has been well studied in the 
above-cited papers, little is known about their asymptotic behavior and power under local 
alternatives. This absence of result can certainly be explained by the curved nature of the 
parameter space M.q x S k ~ 1 . 

In this paper, our aim is therefore to fill this gap by providing explicit expressions of the 
powers of the most efficient tests for both the one-sample problem (Hq : h = kq for some 
fixed kq > versus : k ^ kq) and the multi-sample problem (7i^ om : K\ = . . . = K m 
for m > 2 and K\, . . . , K m > versus "H^ om : 31 < i ^ j < m ki 7^ Kj). We achieve 
this goal by combining the Uniform Local Asymptotic Normality (ULAN) property of 
the concentration-location FvML model (property we first establish) with Le Cam's third 
lemma. By doing so, we shall extend, for the FvML distribution, the ULAN property 
with respect to only the location parameter obtained in Ley et al. (2013a). As we shall 
see, although the ULAN property does not hold for k = 0, we are nevertheless able via Le 
Cam's third Lemma to write down the asymptotic powers of the classical Rayleigh test for 
uniformity, which is nothing but the score test for uniformity against FvML alternatives. 

The rest of the paper is organized as follows. In Section 2, we establish and prove 
the key ingredient of our calculations, namely the ULAN property of the concentration- 
location FvML model. In Sections 3 and 4, we write out the locally and asymptotically 
optimal tests for the one-sample and multi-sample problems, respectively. By construc- 
tion, these coincide with the score tests proposed in Watamori and Jupp (2005) which 
themselves are asymptotically equivalent (the difference is op(l)) to the likelihood ratio 
tests under the null (and therefore also under contiguous alternatives). In each section, 
we then study the asymptotic behavior of these most efficient tests under local alterna- 
tives and provide the announced expressions of their powers. In Section 3, we also study 
the asymptotic properties and powers (against FvML alternatives) of the famous test for 
uniformity over S k ~ 1 proposed in Rayleigh (1919). The finite-sample powers of the tests 
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are investigated in Section 5 by Monte Carlo simulations, and an appendix collects the 
technical proofs. 



2 The ULAN property of the concentration-location FvML 
model 

Let the data points Xi, . . . , X n be i.i.d. with common FvML density (1.1). We denote their 
joint distribution by P^ with $ := («, 8')' E IR^~ xS k ~ 1 . As announced in the Introduction, 
the objective of this section is to establish and prove the ULAN property of the sequence 
of FvML experiments |p^ n \i? E M<j~ x Such a sequence is ULAN (with contiguity 

rate n' 1 / 2 ) if, for any sequence i9 (n) E M+ x S^ 1 such that # (n) - # = 0{n~ l l 2 ), the 
likelihood ratio between P^™i) + _ 1/2 (n) and P^) allows a specific form of (probabilistic) 
Taylor expansion as a function of the perturbation E M x R fe . In view of the curved 
parameter set ~Rq x S k ~ l , it is clear that the local perturbations cannot be chosen 
without care, as they need to satisfy that + n _1//2 r^ n ^ remains in Rq x S k ~ 1 . Hence, 
writing 7» as (c< n \ (t^)')' with E E and t< n ) E we have the conditions 

+ n- l l 2 c {n) > (2.2) 

and 

= (8 {n) + n-VHWyiOV* + rT^H^) - 1 

= 2n- l ' 2 8 {n) 't {n) + n _1 (tW)'tW. (2.3) 

The second condition thus means that the perturbation t( n ) must belong, up to a o(n -1 / 2 ) 
quantity, to the tangent space to S k ~ 1 at 8^ n '. 

In order to ease readability, we introduce some notations. It can be shown that the 
projections X^fl, . . . , X' n are i.i.d. with common density f K (t) proportional to exp(«tf)(l — 
^2^(fc-3)/2 £ or ^ g Straightforward calculations then reveal that 

E[X,] = E[X$0 =: A fc («)* = ^-f— ^ 4=35— *. 

V J^e^l - t 2 ) — dt J 

showing that the parameter k is identified via the function -Afc(-). Note in passing that 

-Afc(-) = Ik/2{')/Ik/2—i(')- Similar manipulations yield 

Var[X$ = 4(k) = 1 - —A k ( K ) - (A k ( K )) 2 ; 



see Watson (1983) for more details on these results. We are now ready to state the ULAN 
property of the FvML concentration-location model. 

Proposition 2.1 The family jP^ | # £ Mq~ X is ULAN; more precisely, for any 

sequence G x S k ~ 1 such that — •& = 0{n~ 1 / 2 ) and any bounded sequence 
T ( n ) gixl' subjected to the conditions (2.2) and (2.3), we have 

,(») 



log 



dP v 

^(«) +Tl -l/2 T (n) 



Q ,9(") 



r (n) ) / Ait-^(rW) / r i? rW + op(l) 



and fcS^ n) —>■ A/fc_|_i(0,r^) under P^l) asn-> oo. The central sequence := ^A^ n ^ , ^Ajj 11 ^ 
is defined by 

n 

A (D(n) :=n -l/2^ W -^(«)) 
i=l 

and 

A (H)(n) ._ 

i=l 

uiii/i Sg(Xj) := (Xj — (X^<?)0)/||Xj — (X^)0||. T/ie associated Fisher information is given 
by 



i9 := 



rf o 
o r<"» 



where, putting Jk{i^) '■= — u 2 )f K (u)du, 

rj 5 := 1 - — - (A fc ( K )) 2 and rf := ^¥(1* " 



This proposition constitutes, for FvML distributions on the hyperspheres the 
desired extension (for FvML distributions) of Proposition 2.2 in Ley ei al. (2013a) where 
only the location parameter was taken into account. Note the diagonal structure of 
the information matrix; it is the structural reason why replacing by a root-ra consistent 
estimator has no asymptotic effect on inferential procedures focussing on k. 

Proof. We clearly need to circumvent the curved nature of the parameter space Mq~ xS k ~ 1 , 
more precisely of Fortunately, this has been achieved in Ley et al. (2013a) by proving 

ULAN rather for the spherical coordinates = h(rj) for some locally full rank chart h and 
then returning (via a result in Hallin et al. 2010) to the initial ^-parameterization. Thus, 
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thanks to the developments in Ley et al. (2013a), all we need to do here is to prove ULAN 
with respect to the "linear" parameters k and r} with r) G 

Our proof of that ULAN result relies on Lemma 1 of Swensen (1985)-more precisely, 
on its extension in Garel and Hallin (1995). Seven conditions need to be satisfied; we 
leave them to the reader, as they are easily obtained once it is proved that the mapping 

1/2 

(ft, 7/) ' — y / K /,(^)( x ) i s differentiable in quadratic mean. The latter differentiability in 
quadratic mean spells out as 



5 fc-i 



,1/2 

J K+s,h(r)+e) 



® K fJ,h(n) ^ 

s rad ^4%)( x ) 





( 




2 \ 


1 1 dcr(x) = o 




s 






{ 


e 


/ 



for s£l and e£l fc 1 such that k + s > and h(rf + e) G S k 1 . This result holds true 
once we have demonstrated the following three equalities: 

2 



W Is" {C(r, + e)W ~ Cir,)^) " fe^/^ x))'e) " da(x) = o(||e| 



Point (i) has been obtained in Ley et al. (2013a). Now for Point (ii), first note that, 
letting 6 := h(r} + e) G S^ 1 , we have 



c 1/2 x'0 

exp(Atx^/2) + (d K c^)exp( K x'0/2). 



2 

Therefore, the integral of Point (ii) can be rewritten as 



/ 



exp(Kx'6>) ( c^ +s exp(sx'0/2) - cj^ - s I c l J 2 J^- + d K c^ j j d<r(x) 



This integral can be bounded by c\S\ + C2S2 + C3S 2 53, where 



and 



<Si := / Ck, K +s exp(Kx'0) exp(sx'0/2) - 1 



SXI 



dcr(x), 



5 2 := 



,1/2 



C fcA ~ a5 " C fc(« ) exp(KX0) dff(x) 



,1/2 



1/2 



Since both exp(ftx'0) and (x'#) 2 exp(rex'0) are obviously integrable on S k ~ 1 , it follows 
from the derivability of the mapping k h-> cj^ that S3 is o(l) and that 52 is o(s 2 ). Now, 
the derivability of the mapping 1 1— > exp(t) at t = combined with Lebesgue's dominated 
convergence theorem directly entails that S\ is o(s 2 ). Putting the ends together, we have 
proved Point (ii). 

Finally, Point (iii) follows along the same lines since all quantities involved are differ- 
entiable and square-integrable. This concludes the proof. □ 

3 One-sample tests on the concentration parameter 

In this section, our focus lies on the one-sample testing problem T-Lq° : k = k,q for some fixed 
kq > versus : k 7^ kq (Section 3.1) and on the Rayleigh (1919) tests of uniformity 
(Section 3.2). In each case, we analyze the most efficient tests (Watamori and Jupp 2005 
and Rayleigh 1919, respectively) in the light of the Le Cam framework. Besides interesting 
optimality issues, this will allow us to obtain expressions for the powers of the considered 
tests under local alternatives. 

3.1 The one-sample score tests of Watamori and Jupp (2005) 

Based on the ULAN property in Proposition 2.1, the Le Cam asymptotic theory (see Le 
Cam 1986) paves the way towards constructing locally and asymptotically optimal tests. 
The optimality appearing in this section is the so-called maximin optimality. A test </>* is 
called maximin in the class C a of level-a tests for T~Lq against "Hi if (i) has level a and 
(ii) the power of (p* is such that 

inf E P [</)*] > sup inf E P [<j>\. 

Since k is the parameter of interest, locally and asymptotically optimal tests for T~Lq° are 
built upon n \ the K-part of the central sequence; see Le Cam (1986), Section 11.9, 
for details. More concretely, a locally and asymptotically maximin test rejects Hq° at 
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asymptotic level a whenever 



(A (DW )2 



>*0V' ■ (I) 

n{l-^l Ak {^)-{A k {K,)Y) 
exceeds the a-upper quantile of the chi-square distribution with 1 degree of freedom. 

(n) 

Unfortunately, Q Ko (6) is not (yet) a genuine test statistic since it still depends on the 
unknown value of 6. This problem can be solved by replacing 6 with a root-ra consistent 
estimator 6 in the central sequence Aj )(n) , whilst, of course, paying attention to the 
asymptotic effects of such a substitution. It is here that the ULAN property of the 
concentration-location FvML model comes in handy. Indeed, it directly entails (see again 
Le Cam 1986) that the FvML model is locally and asymptotically linear in the sense that 

AW n _ 1/2T(n) -AW=r,rW +0p (l) (3.4) 

under Pi"^ as n — > oo. Of course, the aim consists in using = {c^ n \ (t ("))')' with 
t( n ) = n 1 / 2 ^ ^ — 6) which satisfies condition (2.3); controlling this replacement however 
is not straightforward and requires a formal proof. The matters are simplified by the 
(already discussed) block-diagonality of the Fisher information matrix, which implies that 
the K-part of the central sequence is not influenced by a local perturbation of (similarly, 
the 0-part of the central sequence is not influenced by a local perturbation of k, hence the 
results in Ley et al. 2013a for the FvML case can be extended by estimating k). Hence, 
since our focus lies on and since, under the null hypothesis, k is fixed to kq, we only 

need to show by having recourse to the asymptotic linearity property (3.4) for A^^ that 
a replacement of with a root-ra consistent estimator (e.g., the sample spherical mean 
X/||X|| with X = n" 1 ^" =1 Xj) has no asymptotic impact on A^^ n \ which is achieved 
in the following proposition. 

Proposition 3.1 Let 6 ^ G S^ 1 be a root-n consistent estimator of 6 under P^ ■ Then, 

letting := ra" 1 / 2 Y^4=i ^fi> we have that T^?\ — Tg is op(l) under P^ as n — > oo. 



See the appendix for the proof. The resulting locally and asymptotically maximin test 
for testing %q° rejects the null (at asymptotic level a) when 



ra(l - ^A k (K ) - (A k (K ))i) 



exceeds the a-upper quantile of the chi-square distribution with 1 degree of freedom. The 
test statistic Q$ coincides with the score test proposed in Watamori and Jupp (2005). 
This, in passing, shows the local and asymptotic optimality property of the latter. The 

In) 

following result characterizes the asymptotic properties of Q K0 . 
Proposition 3.2 We have that 

(i) Qkq is asymptotically chi-square with 1 degree of freedom under Ug 6<S fe-iP^ g.; 

(ii) Qko is asymptotically non-central chi-square with 1 degree of freedom and non- 
centrality parameter (1 - ^-A k (K ) - (A k (K )) 2 )c 2 under Ufl 65 ft-iP^ +ri _ 1/2c(n) fl) 
(c := linin-s.oo c^ for c^ satisfying condition (2.2)); 

(Hi) the test cf>^J which rejects the null hypothesis as soon as Q$ exceeds the a-upper 
quantile of the chi-square distribution with 1 degree of freedom has asymptotic level 

(n) 

a under Ug^fc-iPW ^ and is locally and asymptotically maximin against local al- 

i. .T 

(k o +™- 1/2 c("),0)' 



ternatives of the form Ug eS k-iP^ 



Proposition 3.2 readily follows from Proposition 2.1, Proposition 3.1 and the celebrated 
third Lemma of Le Cam, and is hence left to the reader (if unclear, see the next section 
where we develop this argument for the Rayleigh 1919 test of uniformity). Note that 

(n) 

Proposition 3.2 readily yields the announced expression for the power of Q K0 under local 
alternatives of the form U flg5fe -iP }Z +n -i/2 c (n) ^ ( c := lim ?woo c (n) ): 



1 -R 



;?((l-^A fe ( K o)-(A fc ( K o)) 2 )c 2 )(^l;l-«)' 



where P x 2 ( 2 ) stands for the distribution function of the non-central chi-square distribution 
with v degrees of freedom and with non-centrality parameter z and xt i—a represents the 
a-upper quantile of the (central) chi-square distribution with v degrees of freedom. 

3.2 The Rayleigh (1919) tests of uniformity 

Let us now come to the Rayleigh test of uniformity. Within the FvML family, the boundary 
distribution obtained when k = is the uniform distribution. Unfortunately, the ULAN 
property of Proposition 2.1 does not hold for k = {inter alia because the location is 
not identified under the null of uniformity). Nevertheless, we show in this subsection that 



a study of the asymptotic local powers of the classical Rayleigh (1919) test which rejects 
the null hypothesis of uniformity %Q Dlf at asymptotic nominal level a when 

Q<2f := H|X|| 2 > Xl,i-a (3.5) 

can be performed using the Third Le Cam Lemma. It follows from (3.5) that in order to 
obtain local powers of Q^ if we have to study the asymptotic behavior of := n 1//2 X 
under local FvML alternatives since Q^ f = /c(T( n ))'T>). First, let 



:= log 



dP W 



(n-V2 c (™),0) 
Qr unif 



stand for the log-likelihood ratio between a FvML distribution with parameters (n _1//2 c ( - n ) , 6) 
and the uniform distribution on S k ~ 1 . Both distributions are clearly contiguous. Very 
simple computations yield 



) 

i=l 

for some constant C\? ( n ) which is o(l) as n — > oo under P^ f - The multivariate central 
limit theorem directly entails that the limiting distribution of ((T^)', A^ n ^) is a {k + 1)- 
variate Gaussian distribution with mean zero and covariance matrix (c := lim^^oo c^) 

ck~ l 0' c 2 k~ x 

under P^ n j f as n — > oo (this holds for any fixed 6 G S fe_1 ). Then the third Le Cam 
Lemma entails that the limiting distribution of is a /c-variate Gaussian distribution 
with mean ck~ l 6 and covariance matrix under P,™_i/ 2c (n) ^ as n -> oo. Wrapping 

up, we obtain the following result. 

Proposition 3.3 We have that 

(i) (Rayleigh 1919) Q^ f is asymptotically chi-square with k degrees of freedom under 

1/unif . 

(ii) Qy™- f is asymptotically non-central chi-square with k degrees of freedom and non- 
centrality parameter c 2 /k under U 0giS fc-iP^_ 1/2c(n) ^ (c := lim n _ > . 00 c^ for c^ sat- 
isfying condition (2.2)); 
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(n) 

The power of the Rayleigh test under local alternatives of the form Ug eS k-i¥ K .J_ 1/2c ^ ^ 



is given by (c := lim,; 



1 ~~ F xl{c 2 /k){xi;l-a)- 

Figure 1 right below shows power curves of the Rayleigh test for different values of the 
dimension k against local FvML alternatives. Note that the power of the Rayleigh test 
decreases as the dimension k increases. 



Power curves (test for a uniform distribution) 



dimension 
--- k=2 

k=3 

•— k=4 
— k=5 



~~ 1 — 
10 



Figure 1: Power curve (the local alternative is P[™-i/2/ c / 2 ) #)) °f 0un!f f° r dimensions k = 2, . . . , 5. 



4 Multi-sample tests on the equality of concentrations 

In this section, our focus lies on the multi-sample testing problem T-L^ om : n± = . . . = K m 
for m > 2 and K m > versus %^ om : 31 < i ^ j < m K{ ^ Kj. In other words, we 

are dealing with m(> 2) samples of i.i.d. data points Xji, . . . , Xj nj with common FvML 
distribution with concentration Kj and location 6i for alH = 1, . . . , m, and we are interested 
in determining whether or not these m samples have the same concentration parameters, 
without assuming equality of the mean directions <?j. As in the previous section, our way 
of proceeding consists in "re-discovering" the score tests of Watamori and Jupp (2005) 
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thanks to the ULAN property (which we shall adapt to the multi-sample case) and then 
unveiling new asymptotic results for these tests. 

Let us assume that the samples (Xjx, • • • , Xj n .), i = 1, . . . ,m, are mutually independent 
samples of i.i.d. random vectors; as already mentioned above, the rij observations Xjj, 
j = 1, . . . , m, in sample % are i.i.d. with common FvML density with concentration K{ and 
location 0j. We denote this time by P^l) the joint distribution of (Xn, . . . , X mrim ), with 
# m ) : = ( Kl ,...,K m ,e' 1: .. .,6' m y G (R^) m x (S k ~ 1 ) m . In order to be able to state our 
results, we need to impose a certain amount of control on the respective sample sizes rij, 
i = 1, . . . , m. This is achieved via the following 

Assumption A. Let n = YUiLx n i- For all i = 1, . . . , m, the ratio r^ := rii/n converges 
to a non-zero constant rj as n — > oo. 



A direct consequence of Assumption A is that the specific sizes rii become somehow ir- 
relevant; hence, in what follows, we simply use the superscript ( n ) for the different quan- 
tities at play and do not specify whether they are associated with a given nj. Now, let 
diag(A i, . . . , A m ) stand for the block-diagonal matrix with blocks A.\, ■ ■ ■ , A m , and use 
the notation i/W := diag^j™ , ), where '■= diag((r^) -1 / 2 , . . . , (r^) -1 / 2 ) and 
:= diag((r^ n ^)~ 1//2 Ifc, . . . , (r^)" 1 / 2 !*;)- As in the one-sample case, we only consider 
perturbations = {cf\ . . . , (t[ n) )', . . . , (t&ty)' G R m x (R k ) m such that, for any 
t? (m) G (M+) m x (cS^- 1 )™, # (m) + n - 1 /V n M ri ) remains in (M^) m x (5 fc " 1 ) m (this is simply 
an adaptation of the conditions (2.2) and (2.3)). This readily leads us to the following 
multi-sample version of Proposition 2.1, whose straightforward proof is omitted. 



Proposition 4.1 Let Assumption A hold. Then the family |P^1) I # (m) G ( M o ) m x ( S 
is ULAN; more precisely, for any sequence (i?M)( n ) £ (R+) m x (S^ 1 )" 1 such that (# m ))( n ) 
any bounded sequence as described just before, 

/dP (n) \ 
bg (<M y W . M . = (tW) - A (» 1))w _ £ (tW)WW + op(1 , 

V dF (^(™))(«) / 

and ^|^(m)-)(n) ~^ Mn(fe+l) (0, r^( m ) ) under P^l } as n oo. TTie central sequence 
A<<> , == ((AjS?)', (A™,"')')' — AJR? == ( W)' - • • • (A^'-'VY mi 
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== ((A»'i»'»»)',.., (a«»-)<»))')', ,., ^ h 

^ :=n- 1 / 2 ^(X^-A fc (^)) 

3=1 

and 

rii 
3=1 

/or all i = 1, . . . ,m, with S^X^-) := (Xjj — (X^0j)0j)/||Xij — (X^-0j)0j||. TTte associated 
Fisher information is given byV^ im) := diag(r^ ro) ^J^), whereT® m) := diag(rj£Jj . . . ,rj£$) 
with rjg, := l-^lAfcCKiMAfcta)) 2 for alii = l,...,m and where r™,, := diag(rJJ#, • • • ,r$^ 



/or all i = 1, . . . ,m. 



As for the one-sample case in Section 3, we use the ULAN property to construct a 
locally and asymptotically optimal test for the homogeneity of concentrations. Here, the 
underpinning optimality concept provides the so-called most stringent test for 1~C^ ora . A 
test (j>* is called most stringent in the class of level-a tests C a for testing %q against T~L± if 
(i) (jf has level a and (ii) is such that 

sup ?>*(P) < sup r^(P) G C a , 
Pe-Hi PeWi 

where r^ (P) stands for the regret of the test (fio under P G Hi defined as r^ (P) := 
[sup^ gCa Ep [</>]] — Epf^o], the deficiency in power of <po under P compared to the highest 
possible (for tests belonging to C a ) power under P. 

Letting l m := (1, . . . , 1)' G R m , the null hypothesis U^ ora can be rewritten as U^ om : 
(ki , . . . , K m )' G A4(l m ), where A4(A) stands for the linear subspace spanned by the 
columns of A. Following Le Cam (1986), a locally and asymptotically most stringent test 
rejects the null hypothesis om at asymptotic level a when (writing k for the common 
value of K U ...,K m under the null, D k := 1 - ^A k (n) - (A fe ( K )) 2 , T (n) := (i/^)-* 1 ™ 
and Xj := n" 1 YLlLi x 



ij > 



£r K (tf' i X 4 ) a --f£n^X i J 



(i)W 
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exceeds the a-upper quantile of the chi-square distribution with m — 1 degrees of freedom. 
As for the one-sample case, the statistic Q^^S^"^) is not (yet) a genuine test statistic 
since it still depends on the unknown location parameters 6\, . . . ,6 m and moreover on the 



quantity D^. The replacement of the location parameters with root-ra consistent estimators 
(e.g., &i ^ = Xi/||Xi||, . . . ,0 m = X m /||X m || with X,-/||Xj|| the jth intra-sample spherical 
mean) will not have any asymptotic impact on Q£ m (# (m) ), see Proposition 3.1. As 
concerns the quantity D k , it can be estimated consistently by Dk := 1 — ^j^-Ak(k) — 
(A k (k)) 2 , where, putting «i = A" 1 (W^W) , . . . , k m = ^(IIX^II), « : = YZi^ k i- The 

(n) 

resulting locally and asymptotically most stringent test </>jj om rejects the null hypothesis 
T-Lf^ om at asymptotic level a whenever 

exceeds the a-upper quantile of the chi-square distribution with m — 1 degrees of freedom. 
Again, the test statistic Q^ m coincides with the score test proposed in Watamori and 
Jupp (2005) which is therefore locally and asymptotically most stringent. The following 

in) 

result characterizes the asymptotic properties of Q^om un der the null and under a sequence 
of local alternatives. 

Proposition 4.2 Let Assumption A hold. We have that 

(i) <5jj^ m is asymptotically chi-square with m — 1 degrees of freedom under 7-L^ om ; 

(ii) letting c = (ci,...,c m ) := lim n _ 5 . 00 (c^ l \ . . . , c^)', Q^ m is asymptotically non- 
central chi-square with m — 1 degrees of freedom and non-centrality parameter 

m / m \ ^ 

D * E c '" Ev 7 ^ (4.6) 
%=\ \%=\ ) 

under U (( ; r . W(6( ^i )m U K£R + P ( ")... j ^ li ...^y^-i/2 v (») T (»); 

(Hi) the test (pulm which rejects the null hypothesis as soon as Q^ m exceeds the a-upper 
quantile of the chi-square distribution with m — 1 degrees of freedom has asymptotic 
level a under l-l^ om and is locally and asymptotically most stringent against local 
alternatives of the form U^^y^-i)™ U KeR + P£)... j ^ 1) ...^ ) , +n -i/v.0TM ■ 
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See the appendix for a proof. Note that, when all quantities c\ {rf ) 1//2 (and hence 

— 1/2 

the limit ar i ) are equal, we are still under the null; this is well translated by the fact 
that then the non-centrality parameter in (4.6) equals zero. Proposition 4.2 also readily 

(n) 

yields the announced expression for the power of Qjjom un der local alternatives of the form 

(n) 

U (ei,...,e; n )'6(5 fc - 1 )'" u k6 r+ P ( K ,..., K ,0' 1 ,...,e; n )'+n-i/2 1 /(")r(«) : 

1 " F x^-i(o fe [EIL 1 cf-(Er=i^) 2 ]) (x ^- 1 ; 1 - a) - 
We conclude this section by attracting the reader's attention to the fact that this multi- 
sample problem here complements, for the FvML case, the ANOVA study in Ley et 
al. (2013b). 

In the next section, we study the finite-sample powers of the tests constructed here via 
Monte Carlo simulations. 



5 Monte Carlo simulations 

Since Watamori and Jupp (2005) do not examine the finite-sample performances of their 
score tests, we will do so via a Monte Carlo study in this section. More precisely, we 
shall concentrate on the multi-sample case and hence complement the theoretical powers 
provided at the end of the previous section by a simulation study. However, before start- 
ing this analysis, we will first verify numerically the asymptotic powers obtained for the 
Rayleigh (1919) test. 

5.1 Power curve of the Rayleigh (1919) test 

The aim of this subsection is to corroborate Proposition 3.3 and the ensuing power curves 
by showing that empirical power curves do converge to the theoretical ones. To do so, 
we generated N = 5,000 independent replications of circular FvML (hence, in fact, von 
Mises) random vectors 

X c;i , c = 0, ...,10, i = l,...,n, 

with concentration n _1 / 2 c/2 and location 6 = (1,0)'. The vectors Xo ; i represent the null 
hypothesis while the vectors X c; j for c = 1, . . . , 10 are (increasingly) under the alternative. 
The results using sample sizes n = 50, n = 200 and n = 500 are plotted in Figure 2. They 
clearly confirm the theoretical power curves and hence Proposition 3.3. 
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Power curves (test for a uniform distribution) 




// 



/ 
/ 



~~ I — 

10 



Figure 2: Power curve of 4>^ if for k = 2 and various sample sizes n = 50, n = 100 and n = 200. 
The "n=inf curve is the true (theoretical) power curve (obtained using Proposition 3.3). 



5.2 Finite sample behavior of the most stringent test for the homogene- 
ity of concentrations 

(n) 

In this subsection, we investigate the finite-sample behavior of the test </>jjo m for the 
homogeneity of the concentrations. We generated N = 5, 000 independent replications 
of three pairs (m = 2) of mutually independent samples (we considered two designs; first 
with respective, and relatively small, sizes n\ = 100 and ri2 = 150 and then with respective 
moderate sample sizes n\ = 500 and ri2 = 500) of circular random vectors 

and X fc; 2j 2 , £ = 1,2,3, c = 0, . . . , 10, ji = l,...,m, i = l,2, 

with FvML densities such that 

(i) Xijijj and Xio ; 2j 2 have a common concentration k = 1 and locations 6\ = (1,0)' 
and f?2 = (—1, 0)'. Then for c = 1, . . . , 10, the Xi c; 2j 2 's have concentration 1 + c/10 
and still locations 0i = (1,0)' and 2 = (-1,0)'. 

(ii) X2;ij 1 and X2o ; 2j 2 have a common concentration k = 5 and locations B\ = (1,0)' 
and $2 = (—1, 0)'. Then for c = 1, . . . , 10, the X2 C; 2j 2 's have concentration 5 + c/10 
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and still locations 0i = (1,0)' and 6 2 = (-1,0)'. 

(iii) X3-1,! and X3o ; 2j 2 have a common concentration k = 10 and locations 6\ = (1,0)' 
and 02 = ( — 1, 0)'. Then for c = 1, . . . , 10, the X3 C; 2j 2 's have concentration 10 + c/10 
and still locations Q x = (1,0)' and 2 = (-1,0)'. 

For all £ = 1,2,3, the random vectors X^-i- and X£o ; 2j 2 are un der the null hypothesis. 
Then, for c = 1, . . . , 10, the random vectors X^i^ and X^q^ are (increasingly) under the 
alternative. The results are plotted in Figures 3 and 4. Inspection of the Figures reveals 

in) 

that the test </>jj om reaches the nominal level constraint even with small sample sizes. The 
power of the test decreases when the concentration increases. 



Power curves (test for homogeneity) 



concentration 

kappa=1 

kappa=5 

kappa=10 



Figure 3: Empirical power curves of 4>j^ m for k = 2, various concentration parameters (k 
1, 5, 10) and sample sizes n\ = 100 and n 2 = 150 
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Power curves (test for homogeneity) 



concentration 

kappa=1 

kappa=5 

— kappa=10 



I — 
10 



Figure 4: Empirical power curves of (f%^ m for k = 2, various concentration parameters (k 
1, 5, 10) and sample sizes n\ = 500 and n-z = 500 



A Appendix: Proofs 

-(n) 

Proof of Proposition 3.1 First note that, combining the fact that 6 and 6 have norm 
1 with the delta method applied to the mapping x h-» x/||x||, we have that 

n v\e-o) = - 1/2 (J|-m) 

= (I fc -00V /2 (*-0) + op(l) (A.7) 

under as n — > oo. Now, the law of large numbers, the fact that E[XJ = E(X'-^)^ and 
(A. 7) readily entail that 

T {n)_ T (n) = xV/ 2 (0-0) 

= (EpCi])'(I fc - d6')n l l 2 {6 -6) + o P (l) 
= E(X[0)0'(I k - ee')^' 2 ^ -&)+ op(l) 
= o P (l) 

under as n — > oo, which is the desired result. 
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□ 



Proof of Proposition 4.2 We first show Point (i). Since 0i is a root-n^ consistent 
estimator of 0j, following the proof of Proposition 3.1, we readily have that 

XjnJ /2 (di - A*) 

is op (1) under PZ } as n — >• oo. As a direct consequence, letting i? := (d 1 ,...,d m , 
we obviously have that 

A^, (n) - Ag2,W = -(r^n^A^k) - A k (n)) + o P (l) 

under P$ with t? m G 7$ om as n — )• oo. It follows directly that from the delta method 
and from Mardia and Jupp (2000, p. 199) that 



A^ff-AgS? = -T(™)n 1 /2 ( ^ ( ^)_A fc ( K ))+ p(l) (A.8) 

= -T^A'^n^ik- k)+op(1) 

: -T( n ) J D fc n 1 / 2 (K-K)+o P (l) 

= -rWjWn^- K ) +0p (l) 



still under P^„) as n — >• oo with #( m ) such that all components are equal (that is, we 
are under T-L^ om ). Therefore, defining 

(TiL) 1 := (TjL)- 1 -TW ((TW)TN T^V 1 ( T(n) )'» 



the consistency of ^ together with (A.8) entails that 

^Hom — ^HomV* 7 J 

- fA (I) vrr (I) ^a (i) 

= ( A JL -r^ ) T (n) - 1/2 («-«)))'(^l)) ± (A^ ) -r^ ) THn^(R- K ))) + 0p( l) 

= (Ail)) / (r^ ) )Xl)+°p( 1 ) 

= Q£ m (tf (m) )+°p(i) 

under P^™^) as n — )• oo. Then, Point (i) directly follows from the asymptotic normality of 
A^(L) m Proposition 4.1 and from the fact that (P^m^^^m) 1S idempotent with trace 
m — 1. Point (ii) follows by applying Le Cam's third Lemma as in Proposition 3.3. For 
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Point (iii), see Le Cam (1986) or Hallin and Paindaveine (2008) for a more recent reference. 
□ 
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